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MICROPROCESSOR 

The present invention relates to a simplified instruction set microprocessor. More 
particularly, though not necessarily, the invention relates to a microprocessor which may be 
used to implement a version of the Java Virtual Machine. 

High level language oriented computer architectures have been known for many years, but 
until now few of them have been successful commercially. Currently, the most popular 
computer architectures are either general-purpose Complex Instruction Set Computers 
(CISC) such as the Intel 80x86 family of microprocessors, or general-purpose Reduced 
Instruction Set Computers (RISC). 

The success of the Java programming language, and the demands which the Java 
environment places on an execution platform, may change this situation. This means that 
language specific architectures may in the future become more popular as a Java-oriented 
architecture is likely to be more efficient in executing Java than a general-purpose 
microprocessor. 

Java is a young technology and has been designed to run on a variety of computing 
platforms, but the benefits of using dedicated special-purpose architectures have been 
apparent from the outset. SUN Microsystems™, the originators of Java, have developed a 
microprocessor core called picoJava™. This core is targeted toward the high-end of the 
computer market, i.e. it is intended for use in large desktop computer type applications. 
picoJava™ is large, complex and power-hungry, making it unsuitable for smaller, 
embedded types of applications. 

Certain similarities between the Java Virtual Machine and the Forth programming language 
have led manufacturers of Forth chips to re-brand their chips as Java chips. While some 
Forth chips are better at implementing Java than other general-purpose microprocessors, 
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differences between the Java Virtual Machine and Forth ensure that a dedicated Java 
microprocessor will generally outperform a Forth microprocessor. 

Embedded systems often operate under stringent real-time constraints. In particular, the 
interrupt latency of a processor is a critical parameter. With complex microcoded 
instructions, interrupt latency can be on the order of several dozen or even hundreds of 
clock cycles. 

It is an object of the present invention to provide a high-performance, virtual machine 
tailored microprocessor with a reduced transistor count compared to other processors, such 
as existing Java processors. 

It is also an object of this invention to provide a high-performance microprocessor, which 
has the low interrupt latency necessary in many real-time embedded systems. 

It is a further object of the invention to provide extremely good code density for high-level 
language generated programs. 

It is a further object of the invention to simplify the translation process from a machine 
independent bytecode representation of a program, such as the Java Virtual Machine 
bytecodes, to the microprocessor's instruction set, so that "on the fly" application program 
loaders, such as the Java class loaders, can be implemented at minimal cost. 

It is a further object of the invention to provide a means for the microprocessor to 
communicate with other microprocessors via a local area network, and to enable software 
upgrades to be performed remotely over the network. 

It is a further object of the invention to provide the means for the microprocessor to 
interface directly to a dedicated slave co-processor (such as a numeric co-processor, a 
digital signal processor (DSP), a special purpose communication processor, etc.) so that 
special purpose systems can be easily implemented. 
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According to a first aspect of the present invention there is provided a microprocessor 
system comprising: 

a central processing unit; 

an instruction memory for storing a sequence of instructions which correspond 
either to a fixed and predefined operation or to a user defined operation; and 

means for fetching each stored instrucion in turn and for analysing each instruction 
to determine whether an instruction corresponds either to a fixed and predefined operation 
or to a user defined operation and, in the former case, for passing the instruction to the 
central processing unit for execution or, in the latter case, for calling a subroutine 
corresponding to the instruction. 

Preferably, instructions corresponding to fixed and predefined operations are distinguished 
from instructions corresponding to user defined operations by a bit in a predefined bit 
position of the instruction code. Thus, the means for analysing each stored instruction is 
arranged to check for the presence of a bit in that bit position. 

Preferably, the microprocessor system comprises a data memory arranged in use to store 
code defining said subroutines. More preferably, the system comprises means for 
generating an address corresponding to the location of a subroutine if an instruction 
corresponds to user defined operations. Where said distinguishing bit is the most 
significant bit of the instruction code, this means is arranged to shift the code to the left by 
one or more bits. Preferably, the microprocessor comprises a program counter register 
which is arranged to load the bit shifted instruction. 

Preferably, the microprocessor system comprises a hardware stack arranged in use to store 
a return address when a subroutine is entered, the return address pointing to the next 
instruction in the instruction memory when execution of the subroutine is completed. 

Preferably, the central processing unit, instruction memory, data memory, hardware stack, 
and program counter are all coupled to a common bus. More preferably, all of these 
components including the bus are integrated onto a single chip. 
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Preferably, the instruction memory is arranged to hold 8-bit wide instructions, whilst the 
data memory is arranged to hold 32-bit data values. 

Preferably, the central processing unit contains an arithmetic logic unit and a data stack. 
The top two elements of the data stack are connected to the inputs of the arithmetic logic 
unit and the output of the arithmetic logic unit is connected to an internal data bus. 

Preferably, the top three elements of the data stack contain special-purpose circuits, which 
enable the efficient (single cycle) execution of seven primitive stack operations directly in 
hardware. The remaining data stack elements are simple shift registers. 

Preferably, the microprocessor system has a means for recognising a "fast return" 
instruction "folded" with a regular instruction, by utilising circuitry which decodes the "fast 
call" bit in the 8-bit bytecode and another dedicated bit (or bits) in the 8-bit bytecode. 
More preferably, this other bit (or bits) is (are) the second (and subsequent) most significant 
bit(s) in the 8-bit bytecode. 

Preferably, the microprocessor contains a dedicated register called the Parameter Pool 
Pointer, together with associated circuitry and several dedicated instructions for storing and 
accessing 32-bit quantities in data memory using short (8-bit or 16-bit) offsets. This 
mechanism allows the efficient implementation of local (dynamic) variables in block and 
object oriented languages. 

Preferably, the microprocessor contains a dedicated register called the Global Constant 
Pool Pointer, together with associated circuitry and dedicated instructions for storing and 
accessing 32-bit quantities in data memory using short (8-bit or 16-bit) offsets. This 
mechanism allows the efficient implementation of 32-bit literal constants, which are global 
to all execution contexts, using only an 8-bit or 16-bit bytecode extension. 

Preferably, the microprocessor contains a dedicated register called the Local Constant Pool 
Pointer, together with associated circuitry and dedicated instructions for storing and 
accessing 32-bit quantities in data memory using short (8-bit or 16-bit) offsets. This allows 
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the efficient implementation of 32-bit literal constants, which are local to a particular 
execution context. 

Preferably, the microprocessor contains a dedicated register called the Extension Stack 
Pointer, together with dedicated circuits, which is used to spill data stack elements into data 
memory, and to refill the data stack from data memory. 

Preferably, the microprocessor contains dedicated circuitry, to efficiently implement a 
subroutine call via an in-memory jump table, which is essential for an efficient 
implementation of dynamic method dispatch in object oriented programming languages. In 
a preferred embodiment of this invention this language would be the Java programming 
language. 

Preferably, the microprocessor contains dedicated circuitry and instruction to improve the 
efficiency of exception handlers. 

Preferably, the microprocessor contains dedicated hardware mechanisms to allow the 
efficient implementation of a variety of garbage-collection algorithms, which are essential 
in many object-oriented systems, such as Java. 

Preferably, the microprocessor contains circuits for interfacing the microprocessor to a data 
network and to allow the microprocessor's software to be dynamically upgraded over that 
network. 

Preferably, the microprocessor contains a means for communicating with a special-purpose 
co-processor, such as a DSP processor or a math processor. The co-processor can be 
integrated on the same silicon die as the microprocessor, or can be located off-chip. 

According to a second aspect of the present invention there is provided a method of 
processing a set of instructions in a microprocessor system, the method comprising: 

extracting instructions in sequence from an instruction memory, where said 
instructions correspond either to a fixed and predefined operation or to a user defined 
operation; 
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analysing each extracted instruction to determine whether the instruction 
corresponds either to a fixed and predefined operation or to a user defined operation; 

in the former case, passing the instruction to the central processing unit for 
execution; and 

in the latter case, calling a subroutine corresponding to the instruction. 

According to a third aspect of the present invention there is provided a microprocessor 
system comprising a central processing unit, an 8-bit wide bytecode memory, a 32-bit data 
memory, and busses connecting the central processing unit with the two memories. 

For a better understanding of the present invention and in order to show how the same may 
be carried into effect reference will now be made, by way of example, to the accompanying 
drawings, in which: 

Figure 1 illustrates schematically a microprocessor system; 

Figure 2 illustrates schematically a fast subroutine call mechanism of the system of Figure 

i; 

Figure 3 illustrates schematically a the folding of a subroutine return operation in the 
system of Figure 1 ; 

Figure 4 illustrates schematically immediate operand decode logic of the system of Figure 
1; and 

Figure 5 illustrates schematically a data stack of the system of Figure 1. 

Figure 1 shows a block diagram of a microprocessor 10. It is seen that the microprocessor 
10 has a simple architecture. The central processing unit is connected to two memories: an 
8-bit wide Bytecode Memory 100 and a 32-bit wide Data Memory 1 14. The CPU contains 
a main Arithmetic Logic Unit 103 and two hardware stacks: the Return Stack 104 and the 
Data Stack 1 08. The top two elements of the Data Stack 108 are connected directly to the 
inputs of the ALU 103. The CPU also contains an 8-bit Instruction Register 101 for 
latching the currently executing instruction bytecode, and an Immediate Operand circuit 
102 which connects the output of the Bytecode Memory to the CPU's Internal Data Bus 
115. The CPU also contains a Program Counter register 105, which is connected to the 
input of the Bytecode Memory Address ALU 1 06 and also to the input and output ports of 
the Return Stack 104. In addition to the above, the CPU also contains four dedicated 
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address registers, namely the Global Constant Pool Pointer 109, the Local Constant Pool 
Pointer 110, the Extension Stack Pointer 111, and the Parameter Pool Pointer 1 12. The 
read ports of the address registers are connected to the input of the Data Memory Address 
ALU 113. 

Figure 2 shows a block diagram of the fast subroutine call mechanism. The microprocessor 
instruction fetch logic includes a circuit which distinguishes a bit 150 (the most significant 
bit in a preferred embodiment of the invention) in the 8-bit instruction register 101 which 
latches a bytecode fetched from a bytecode memory 100. If the bit 150 is active, the 
microprocessor program counter is loaded with the output of the Address Generator circuit 
151 which uses the remaining 7 bits of the bytecode to produce an address. In a preferred 
embodiment of the invention, the circuit 151 shifts the low 7 bits left by two bits. The 
program counter register load is determined by the control signal on gate 152. At the same 
time, the current value of the program counter 105 is stored in the on-chip return stack 104. 
This sequence of actions is performed in a single clock cycle, and is equivalent to a fast 
subroutine call to one of 128 possible locations. 

Figure 3 shows the block diagram of the circuit implementing the folding of a subroutine 
return operation with a regular instruction. The microprocessor instruction decode logic 
includes a circuit which tests two bits 150, 200 in the 8-bit instruction register 101 which 
latches a bytecode fetched from bytecode memory, and controls a circuit 201, which 
reloads the program counter register from the top of the return stack. In a preferred 
embodiment of the invention the two bits would be the two most significant bits in the 8-bit 
bytecode. This action is performed in parallel with normal instruction execution, and 
means that any (regular) instruction of the microprocessor can be "folded" with a return 
from subroutine operation, effectively providing a zero-overhead operation. 

The fast call/fast return features of the microprocessor allow very short instruction 
sequences to be encoded as 8-bit user-defined bytecodes. This feature is a key to providing 
good code density and also good interrupt latency, since the "macro" instructions are 
composed of a sequence of simple (RISC-like) machine instructions of the microprocessor, 
and can be interrupted without problems. 
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The instruction fetch logic of the microprocessor includes a circuit, shown in Figure 4, 
which allows a bytecode to be followed by a single 8-bit immediate value. This immediate 
value is read from bytecode memory 1 00 and latched in the immediate operand module 
102. Depending on the bytecode, this 8-bit value can be combined with one of the address 
registers 109, 1 10, 1 1 1, 1 12, shown in Figure 4 as reference numeral 250, to provide an 
address into data memory, from which a full 32-bit immediate value is fetched. 

The organisation of the Data Stack is shown in Figure 5. The top three data stack entries 
300 are different from the remaining stack entries 301 and are provided with special- 
purpose circuits, which enable them to execute seven primitive stack manipulation 
operations directly in hardware in one clock cycle. The top two elements of the Data Stack 
are connected to the inputs of the microprocessor's Arithmetic Logic Unit 103. The stack 
manipulation primitives have been selected to either directly correspond to some Java 
Virtual Machine stack manipulation instructions, and to allow the composition of the 
remaining Java Virtual Machine instructions from two or three primitives. The primitive 
operations are: 

• POP Remove the top stack element 

• DUP Duplicate the top stack element 

• OVER Copy the second stack element over the top stack element 

• SWAP Swap the top two stack elements 

• LROT Rotate the top 3 stack elements left 

• RROT Rotate the top 3 stack elements right 

• TOVER Copy the third stack element over the top stack element 

The remaining Data Stack elements 301 are implemented as a simple array of 32 by n shift 
registers. 

Object-oriented languages, such as Java, require the efficient implementation of local (or 
dynamic) variables. The microprocessor here described supports the concept of a 
Parameter Pool A parameter pool is an area of data (32-bit) memory, with individually 
addressable locations. The addresses of the individual locations are small positive integers. 
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The Parameter Pool is implemented using a dedicated pointer register called the Parameter 
Pool Pointer register. Hardware mechanisms are provided for transferring data from the 
data stack to the Parameter Pool, and for accessing and storing individual elements in the 
pool using either a short (8-bit) offset in the bytecode stream, or a 32-bit offset from the top 
of the data stack. 

A 32-bit architecture requires 32-bit literal constant operands to be included in the 
instruction set. Current practice in bytecoded architectures is to embed the literal constant 
in the bytecode stream. This increases the code size, and makes the instruction decoding 
circuits more complex. The approach taken with the present microprocessor is the 
provision of constant pools, which hold the values of the literal constants. Two pools are 
provided for each execution context. A Global Constant Pool contains literal constants 
common to all execution contexts (the most commonly occurring constants, plus constants 
for the operation of the virtual machine). A Local Constant Pool contains literal constants 
specific to a particular execution context. The constant pools are implemented using two 
dedicated pointer registers: the Global Constant Pool Pointer register and the Local 
Constant Pool Pointer register. Hardware mechanisms are provided to enable the 
initialisation of the constant pools and the efficient retrieval of any constant from the pools 
using an 8-bit offset in the bytecode stream, or a 32-bit offset from the top of data stack. 

The on-chip data stack is used for expression evaluation. In a preferred embodiment of the 
invention the depth is equal to 8, which is adequate for most situations. The (relatively) 
shallow depth of the data stack makes context switching faster, since in the case in question 
only at most 8 elements need to be spilled into memory. In some cases, the number of 
elements on the data stack may exceed 8, causing stack overflow. To deal with this 
situation a dedicated register, called the Extension Stack Pointer is provided, together with 
dedicated circuits for loading/storing the bottom element of the data stack in data memory, 
according to the address stored in the Extension Stack Pointer. 

Object oriented languages use dynamic method dispatch very extensively. The present 
microprocessor implements dynamic method dispatch using subroutine calls via a jump 
table. This operation is performed using a built-in instruction of the microprocessor. 
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Many modern programming languages, such as Java, include facilities for defining 
exception handlers. The present microprocessor has instructions and dedicated circuits to 
improve the efficiency of the implementation of exception handlers. 

Modern high-level programming languages, such as Java, use sophisticated memory 
management techniques based on garbage collection. The present microprocessor contains 
circuits, which allow the efficient implementation of a variety of garbage collection 
algorithms, for different application areas. 

Since the "semantic gap" between the microprocessor here described and typical stack- 
based virtual machine architectures is small, it is possible to implement very simple 
dynamic loaders, which allow machine-independent application code (such as software 
upgrades) for the microprocessor to be downloaded over a local data network. In a 
preferred embodiment of the invention, the machine-independent code would be the Java 
class file format. For this reason, the microprocessor contains a unit for connecting the 
processor to a local area network. 

Many modem embedded systems consist of a control module and a data-processing 
module. The control module is usually implemented by a general-purpose microprocessor, 
while the data processing part is implemented by a dedicated hardware processor. 
Depending on the application area, the dedicated hardware processor can be a digital signal 
processing (DSP) processor, or a special-purpose math co-processor. To allow the present 
microprocessor to be used in such situations, special purpose instructions have been 
provided in the microprocessor, together with dedicated circuitry enabling the processor to 
directly interface to a wide variety of special-purpose co-processors. 

The microprocessor described above has the following important distinguishing features: 

• Low transistor count and low power dissipation for use in small embedded systems 

• Unique two-level architecture involving a fast micro machine and a slower macro 
machine 

• Modified Harvard architecture with an 8-bit wide bytecode memory and a 32-bit wide 
data memory. 
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• Bytecode (0-operand) instruction set for maximum code density. 

• 32-bit internal architecture. 

• A 32-bit wide hardware operand stack coupled to the ALU, with automatic fill/spill 
unit. 

• A 32-bit wide hardware return (subroutine) stack, with automatic fill/spill unit. 

• RISC-like instruction set, with most instructions executing in a single cycle. 

• Zero-overhead subroutine return instruction which can be "folded" with other 
instructions. 

• 128 user-programmable bytecodes facilitating the efficient creation of virtual machines. 

• Hardware support for the efficient execution of high-level languages, such as Java. 

• Global and local constant pools, allowing the specification of 32-bit immediate 
constants using short offsets in the bytecode stream. 

• Parameter pool, allowing the efficient implementation of local variables. 

• Hardware and instructions for dynamic method dispatch 

• Hardware and instructions for the efficient implementation of software exceptions 

• Hardware support for garbage collection algorithms 

• Interface to a local area network for dynamic upgrading of the application software 

• Hardware interface to a variety of on- and off-chip co-processors 

It will be appreciated by the person of skill in the art that various modifications may be 
made to the above described embodiments without departing from the scope of the present 
invention. 
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CLAIMS 

1 . A microprocessor system comprising: 
a central processing unit; 

an instruction memory for storing a sequence of instructions which correspond 
either to a fixed and predefined operation or to a user defined operation; and 

means for fetching each stored instrucion in turn and for analysing each instruction 
to determine whether an instruction corresponds either to a fixed and predefined operation 
or to a user defined operation and, in the former case, for passing the instruction to the 
central processing unit for execution or, in the latter case, for calling a subroutine 
corresponding to the instruction. 

2. A microprocessor system according to claim 1, wherein instructions corresponding 
to fixed and predefined operations are distinguished from instructions corresponding to user 
defined operations by a bit in a predefined bit position of the instruction code, and the 
means for analysing each stored instruction is arranged to check for the presence of a bit in 
that bit position. 

3. A microprocessor system according to claim 1 or 2, wherein the microprocessor 
system comprises a data memory arranged in use to store code defining said subroutines. 

4. A microprocessor system according to claim 3, wherein the system comprises 
means for generating an address corresponding to the location of a subroutine if an 
instruction corresponds to a user defined operations. 

5. A microprocessor system according to claim 4, wherein said distinguishing bit is the 
most significant bit of the instruction code, and the generating means is arranged to shift 
the code to the left by one or more bits. 

6. A microprocessor system according to claim 5 and comprising a program counter 
register which is arranged to load the bit shifted instruction. 
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7. A microprocessor system according to any one of claims 3 to 6, wherein the 
instruction memory is arranged to hold 8-bit wide instructions, whilst the data memory is 
arranged to hold 32-bit data values. 

8. A microprocessor system according to any one of the preceding claims and 
comprising a hardware stack arranged in use to store a return address when a subroutine is 
entered, the return address pointing to the next instruction in the instruction memory when 
execution of the subroutine is completed. 

9. A microprocessor system according to claim 8, wherein the central processing unit, 
instruction memory, data memory, hardware stack, and program counter are all coupled to a 
common bus. 

10. A microprocessor system according to claim 9, wherein the central processing unit, 
instruction memory, data memory, hardware stack, program counter, and common bus are 
integrated onto a single chip. 

11. A microprocessor system according to any one of the preceding claims, wherein the 
central processing unit contains an arithmetic logic unit and a data stack, and the top two 
elements of the data stack are connected to the inputs of the arithmetic logic unit and the 
output of the arithmetic logic unit is connected to an internal data bus. 

12. A microprocessor system according to claim 11, wherein the top three elements of 
the data stack contain special-purpose circuits, which enable the execution of seven 
primitive stack operations directly in hardware. 

13. A microprocessor system according to any one of the preceding claims and 
comprising means for recognising a "fast return" instruction "folded" with a regular 
instruction, by utilising circuitry which decodes the "fast call" bit in an 8-bit bytecode and 
another dedicated bit or bits in the 8-bit bytecode. 

14. A microprocessor system according to claim 13, wherein said other dedicated bit is 
the second most significant bit in the 8-bit bytecode. 
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15. A microprocessor system, consisting of a central processing unit, 8-bit wide 
instruction memory, 32-bit wide data memory and a hardware stack connected via an 
internal bus to the program counter register, together with an instruction decode unit which 
includes a circuit for detecting the presence of a distinguished bit in the 8-bit bytecode, 
together with a circuit for loading the remaining bits of the bytecode shifted left by a 
number of bits into the microprocessor's program counter register, while at the same time 
storing the current value of the program counter register on the aforementioned stack. 
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I Claims Nos. : 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 

an extent that no meaningful International Search can be carried out, specifically: 



3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers atl 
' ' searchable claims. 



2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' ' covers only those claims for which fees were paid, specifically claims Nos.: 



4. | y I No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
^""^ restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-10, 15 



Remark on Protest j | The additional search fees were accompanied by the applicant's protest. 

| j No protest accompanied the payment of additional search fees. 
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When? 
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If no reply is filed, the international preliminary examination report will be established on the basis of this opinion. 
The final date by which the international preliminary 
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I. Basts of the opinion 

1 This opinion has been drawn on the basis of {substitute sheets which have been furnished to the receiving Office 
in response to an invitation under Article 14 are referred to in this opinion as "originally filed",): 

Description, pages: 

1-11 as originally filed 

Claims, No.: 

1-15 as originally filed 

Drawings, sheets: 

1/5-5/5 as originally filed 



2. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

3. This opinion has been established as if (some of) the amendments had not been made, since they 
considered to go beyond the disclosure as filed (Rule 70.2(c)): 



4. Additional observations, if necessary: 



III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non-obvious), 
or to be industrially applicable have not been and will not be examined in respect of: 

□ the entire international application, 

claims Nos. 9-14, 



because: 

□ the said international application, or the said claims Nos. relate to the following subject matter which 
not require an international preliminary examination (specify): 
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□ the description, claims or drawings (indicate particular elements belov^ or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 

□ the claims, or said claims Nos. are so inadequately supported by the description that no meaningful opinion 
could be formed. 

B no international search report has been established for the said claims Nos. 9-14. 

V. Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 

1. Statement 

Novelty (N) Claims 1-10,15 

Inventive step (IS) Claims 
Industrial applicability (IA) Claims 

2. Citations and explanations 
see separate sheet 
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1 . ad section IV: 



For the reasons enclosed in the invitation to pay additional fees and given under 
section 2.2 below, the subject-matter of Claims 1-10 and 15, the subject-matter of 
Claims 11-12 and the subject-matter of Claims 13-14 lack unity a posteriori. 



2. ad section V: 

2.1 Reference is made to the following document: 
D1 =US-A-5,479,621. 

2.2 D1 (see in particular the abstract, column 1 , line 14 to column 2, line 53 with Fig. 1 
and column 4, lines 1 1 to 48) already disclose a data processor comprising the 
features specified in lines 2 to 9 of Claim 1, with the claimed "fixed or predefined 
operation" corresponding to the "directly executable instructions" of D1, which 
would be executed directly in hardware (see column 1, lines 41 and 42 of D1), and 
the claimed "user defined operation" corresponding to the "sub-program" of D1 
(see column 1 , lines 42 to 45 as well as column 1 , last two lines of D1 ). 

While D1 does not explicitly mention that the disclosed data processor can be 
implemented as a microprocessor system, such an implementation is self-evident 
for the skilled person given the trend towards integrated circuit realisation of data 
processors. 

Therefore, the subject-matter of Claim 1 does not involve an inventive step. 

2.3 The feature introduced in Claim 2 is described as to be known in the prior art, see 
in D1 column 1, lines 40 to 51. 

It is obvious for the skilled person that the addressable sub-program of D1 would 
be stored in a dedicated memory, e.g. a so-called "data memory" as specified in 
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Claim 3 of the present application. 

Address generation as specified in Claim 4 is known from D1, see column 1, lines 
65 to 67. 

The further implementation details introduced in Claims 5 to 10 and 15 appear to 
lie within the normal design activity of the skilled person. 

Therefore, the subject-matter of Claims 2 to 10 and 15 do not involve an inventive 
step. 
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(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 
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3. This report contains indications relating to the following items: 



II 
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III 
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IV 
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VI 
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VII 




VIII 
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Basis of the report 
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Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 
Lack of unity of invention 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations suporting such statement 
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I. Basis of the report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as "originally filed" and are not annexed to 
the report since they do not contain amendments (Rules 70. 16 and 70. 1 7).)\ 

Description, pages: 

1-11 as originally filed 

Claims, No.: 

1-15 as received on 06/1 1/2000 with letter of 06/1 1/2000 

Drawings, sheets: 

1/5-5/5 as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1(b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 
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□ the drawings, sheets: 

5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 



V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement 



Novelty (N) 


Yes: 


Claims 


1-15 




No: 


Claims 




Inventive step (IS) 


Yes: 


Claims 


1-15 




No: 


Claims 




Industrial applicability (IA) 


Yes: 


Claims 


1-15 




No: 


Claims 





2. Citations and explanations 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 
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1. ad section V: 

1.1 None of the documents cited in the International Search Report concerns a 
microprocessor system for executing Virtual Machine bytecodes which have 
been translated into respective 8 bit microprocessor instructions, wherein, in the 
event that an instruction corresponds to a fixed and predefined operation, the 
instruction is passed to the central processing unit for execution and, in the event 
that an instruction corresponds to a user defined operation, a subroutine 
corresponding to the instruction is called. 

The closest prior art known from US-A-5,479,621 (D1) is particularly adapted to a 
FORTH type language. It anticipates the principle of having both directly 
executable instructions which are executed directly in hardware and indirectly 
executable instructions which are executed as sub-programs. However, instead of 
providing the means for generating an address corresponding to the location 
of a subroutine, as claimed in Claim 1 , and the circuit for loading the remaining 
bits of the bytecode shifted left by a number of bits into the 
microprocessor's program counter register, as specified in Claim 15, the prior 
art known from D1 regards the instructions which it cannot execute directly 
directly as addresses of a sub-program. In order to have sufficient address- 
space for the sub-programs, the prior art known from D1 makes use of relatively 
long instructions, whereas the present invention provides 8-bit instructions in 
combination with address generating means, such as shifting the instruction left 
by a number of bits. 

None of the other documents cited in the International Search Report suggests 
combination of 8-bit instructions with means for generating an address 
corresponding to the location of a subroutine. 

That is, the common concept linking the subject-matter of independent Claims 1 
and 15 is not rendered obvious by the available prior art. 

1 .2 Dependent Claims 2 to 14 specify embodiments of the subject-matter of Claim 1 . 
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2. ad section VII: 

2.1 Contrary to the requirements of Rule 5.1 (a)(ii) PCT, the relevant background art 
disclosed in the document D1 is not mentioned in the description, nor is this 
document identified therein. 

2.2 Description pages 3 to 6 are not in conformity with the claimed subject-matter. 
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CLAIMS 



L A microprocessor system for executing Virtual Machine bytccodes which have 
been tra n slated into respective 8-bit microprocessor instructions which correspond 
either to a fixed and predefined operation or to a user denned operation, the system 
comprising: 



a central processing unit; 



an instruction memory for storing the sequence of 8-bit microprocessor 



instructions; 

means for fetching each stored instruction in turn and for analysing each 
instruction to determine whether an instruction corresponds either to a fixed and 
predefined operation or to a user defined operation; 

means for generating an address corresponding to the location of a subroutine 
if an instruction corresponds to a user defined operation, 

wherein, in the event that an instruction corresponds to a fixed and predefined 
operation, the instruction is passed to the central processing unit for execution and, in 
the event that an instruction corresponds to a user defined operation, a subroutine 
corresponding to the instruction is called using the generated address. 

2, A microprocessor system according to claim 1, wherein instructions 
corresponding to fixed and predefined operations are distinguished from instructions 
corresponding to user defined operations by a bit in a predefined bit position of the 
instruction code, and the means for analysing each stored instruction is arranged to 
check for the presence of a bit in that bit position. 

3, A microprocessor system according to claim 1 or 2, wherein the 
microprocessor system comprises a data memory arranged in use to store code 
defining said subroutines. 

4, A microprocessor system according to claim 2 or to claim 3 when appended to 
claim 2, wherein said distinguishing bit is the most significant bit of the instruction, 
code, and the generating means is arranged to shift the code to the left by one or more 
bits. 
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5 * A microprocessor system according to claim 4 and comprising a program 
counter register which is arranged to load the bit shifted instruction. 



6. A microprocessor system according to claim 3, wherein the instruction 
memory is arranged to hold 8-bit wide instructions, whilst the data memory is 
arranged to hold 32-bit data values. 

7. A microprocessor system according to any one of the preceding claims and 
comprising a hardware stack arranged in use to store a return address when a 
subroutine is entered, the return address pointing to the next instruction in the 
instruction memory when execution of the subroutine is completed* 

8. A microprocessor system according to claim 7* wherein the centra] processing 
unit, instruction memory, data memory, hardware stack, and program counter are all 
coupled to a common bus. 



9. A microprocessor system according to claim 8, wherein the central processing 
unit, instruction memory, data memory, hardware stack, program counter, and 
common bus are integrated onto a single chip. 



10. A microprocessor system according to any one of the preceding claims, 
wherein the central processing "unit contains an arithmetic logic unit and a data stack, 
and the top two elements of the data stack are connected to the inputs of the arithmetic 
logic unit and the output of the arithmetic logic unit is connected to an internal data 
bus. 



11, A microprocessor system according to claim 1 0, wherein the top three 
elements of the data stack contain special-purpose circuits, which enable the 
execution of seven primitive stack operations directly in hardware. 

12. A microprocessor system according to any one of the preceding claims and 
comprising means for recognising a "fast return" instruction "folded" with a regular 
instruction, by utilising circuitry which decodes the "fast call" bit in an 8-bit bytecode 
and another dedicated bit or bits in the 8-bit bytecode. 
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13. A microprocessor system according to claim 12, wherein said other dedicated 
bit is the second most significant bit in the 8-bit bytecode. 

14. A microprocessor system according to any one of the preceding claims, 
wherein said Virtual Machine bytecodes are Java bytecodes. 



15, A microprocessor system, consisting of a central processing unit, 8-bit wide 
instruction memory, 32-bit wide data memory and a hardware stack connected via an 
internal bus to the program counter register, together with an instruction decode unit 
which includes a circuit for detecting the presence of a distinguished bit in the 8-bit 
bytecode, together with a circuit for loading the remaining bits of the bytecode shifted 
left by a number of bits into the microprocessor's program counter register, while at 
the same time storing the current value of the program counter register on the 
aforementioned stack. 
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2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 
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□ the drawings, sheets: 

5. □ This report has been established as if (some of) the amendments had not been made, since they have been 
considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 



6. Additional observations, if necessary: 



V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement 

Novelty (N) Yes: Claims 1-15 

No: Claims 

Inventive step (IS) Yes: Claims 1-15 

No: Claims 

Industrial applicability (IA) Yes: Claims 1 -1 5 

No: Claims 



2. Citations and explanations 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 
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INTERNATIONAL PRELIMINARY International application No. PCT/GB99/03100 
EXAMINATION REPORT - SEPARATE SHEET 

1. ad section V: 

1 . 1 None of the documents cited in the International Search Report concerns a 
microprocessor system for executing Virtual Machine bytecodes which have 
been translated into respective 8 bit microprocessor instructions, wherein, in the 
event that an instruction corresponds to a fixed and predefined operation, the 
instruction is passed to the central processing unit for execution and, in the event 
that an instruction corresponds to a user defined operation, a subroutine 
corresponding to the instruction is called. 

The closest prior art known from US-A-5,479,621 (D1) is particularly adapted to a 
FORTH type language. It anticipates the principle of having both directly 
executable instructions which are executed directly in hardware and indirectly 
executable instructions which are executed as sub-programs. However, instead of 
providing the means for generating an address corresponding to the location 
of a subroutine, as claimed in Claim 1 , and the circuit for loading the remaining 
bits of the bytecode shifted left by a number of bits into the 
microprocessor's program counter register, as specified in Claim 15, the prior 
art known from D1 regards the instructions which it cannot execute directly 
directly as addresses of a sub-program. In order to have sufficient address- 
space for the sub-programs, the prior art known from D1 makes use of relatively 
long instructions, whereas the present invention provides 8-bit instructions in 
combination with address generating means, such as shifting the instruction left 
by a number of bits. 

None of the other documents cited in the International Search Report suggests 
combination of 8-bit instructions with means for generating an address 
corresponding to the location of a subroutine. 

That is, the common concept linking the subject-matter of independent Claims 1 
and 1 5 is not rendered obvious by the available prior art. 

1 .2 Dependent Claims 2 to 1 4 specify embodiments of the subject-matter of Claim 1 . 
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INTERNATIONAL PRELIMINARY International application No. PCT/GB99/03100 
EXAMINATION REPORT - SEPARATE SHEET 



2. ad section VII: 

2. 1 Contrary to the requirements of Rule 5. 1 (a)(ii) PCT, the relevant background art 
disclosed in the document D1 is not mentioned in the description, nor is this 
document identified therein. 

2.2 Description pages 3 to 6 are not in conformity with the claimed subject-matter. 
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Dear Sirs, 

PCT Application No. PCT/GB99/03100 

Filed 17 September 1999 claiming priority from 

GB Patent Application No. 9822191.4 filed 13 October 1998 

Digital Communication Technologies Ltd 

In response to your communication dated 7 August 2000 we enclose herewith in triplicate a 
replacement set of claims pages 12 to 14. Claim 1 has been amended to relate to a 
microprocessor system for executing Virtual Machine bytecodes (page 2, 3rd paragraph). 
Claim 1 now additionally states that the system has "means for generating an address 
corresponding to the location of a subroutine if an instruction corresponds to a user defined 
operation" This means may be the Address Generator circuit referred to at page 7, lines 9 
and 10 Original claim 3 has been deleted and claims 4 to 14 renumbered 3 to 13 
respectively. A new claim 14 has been added, whilst original claim 15 remains unchanged. 

Java is designed to run on a so-called Java Virtual Machine (JVM). The JVM defines an 8- 
bit instruction set - each instruction is called a "bytecode". In practice, JVMs are emulated 
using processors such as Pentium™ processors. This requires a translation to be made 
between the 8 bit Java instructions and the 32 bit instructions used by Pentium processors. 

The starting point for this invention is a desire to provide a low cost, but still efficient, Java 
based microprocessor. This means that the memory requirements must be low, and the 
architecture simple. In order to satisfy the first requirement, we want the executable code to 
be small using only 8 bit wide instructions. In the past, it has been proposed to achieve this 
by designing a microprocessor which directly executes Java bytecode, i.e. the microprocessor 
has a physical structure and mode of operation corresponding to a JVM. 

However, many of the JVM instructions are very costly to implement in hardware. This 
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means that such a Java processor must either be complex (costly) or it must implement some 
of the complicated JVM instructions via a subroutine mechanism (which carries a speed 
penalty). 

In order to reduce the overheads of implementing JVM instructions, the inventors of the 
present invention have recognised that it is advantageous to transform the Java bytecodes into 
8 bit instructions which have a structure which allows directly executable (simple) 
instructions to be easily distinguishable from instructions which are implemented by a 
subroutine call (the translation between the Java bytecodes and the 8 bit instructions is easily 
achieved using for example a one-to-one mapping mechanism). Whilst the translation step 
itself involves some processing overheads, this is more than compensated for by the fast 
subroutine call mechanism which translation facilitates. It is the operation on translated 8 
bit instructions which provides the inventive step over the known architectures for executing 
Java. This is not taught in any of the documents cited in the International search report. 

It is not the case that designing a microprocessor which operates on a transformed instruction 
set is an obvious step to take in view of US5,479,621 (Philips). In that patent, instructions 
which require a subroutine call contain the address where the subroutine can be found. 
Subroutines necessarily contain multiple instructions located in sequential memory locations 
For example, if we have 50 subroutines each taking 8 memory locations, the subroutines will 
occupy a block of 400 memory locations. The microprocessor of the Philips patent makes 
use of relatively long instructions, i.e. 16 and 32 bits. These instructions are used to directly 
encode subroutine locations and are more than long enough for that purpose, even after an 
identifying bit has been stripped off. 

As has already been noted, the present invention uses 8 bit executable instructions . An 8 bit 
instruction can only identify 128 consecutive memory locations (since one bit is used to tag 
the instruction as a subroutine call) which is wholly insufficient to support the required 
number of subroutines. In the light of the Philips patent, the skilled person would conclude 
that it is not possible to implement a microprocessor which operates on 8 bit mstrucbons 
generated by translating Java bytecode, and would either return to the costly (or slow) JVM 
type architecture or to the expensive (in terms of memory usage) architecture using 32-bit (or 
wider) instructions. The Philips patent does not enable the skilled man to make a 
microprocessor which can execute 8 bit instructions. 

Contrary to what the skilled man would have concluded from the Phillips patent, it has now 
been recognised that the problem of limited address range in an 8 bit instruction can be 
overcome by generating a subroutine address using the instruction as a basis. More 
particularly, this generating step involves an expansion of the instruction (or rather a part ot 
the instruction). In the specific example described in the application, an instruction requiring 
a subroutine call is shifted to the left by two bits. The expanded instruction provides access 
to a 512 byte memory block and allows the addressing of 128 distinct memory locations each 



SHastings\Letters\0 



# 



separated by 4 bytes. However, other mechanisms for expanding the subroutine address 
encoded in an 8-bit instruction are conceivable. 

It is submitted that in the light of the amendments made to the claims the Office should now 
issue a favourable opinion in respect of claim 1. In the event that the examiner has any 
remaining concerns, the undersigned would appreciate the opportunity to discuss the case 
with him on the telephone. 

Please acknowledge receipt of this letter by returning the enclosed form EPO 1037. 

Yours faithfully, 
for Marks & Clerk 



Dr. Robert Lind 
Enc. 
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CLAIMS 

1 . A microprocessor system comprising: 
a central processing unit; 

an instruction memory for storing a sequence of instructions which correspond 
either to a fixed and predefined operation or to a user defined operation; and 

means for fetching each stored instrucion in turn and for analysing each instruction 
to determine whether an instruction corresponds either to a fixed and predefined operation 
or to a user defined operation and, in the former case, for passing the instruction to the 
central processing unit for execution or, in the latter case, for calling a subroutine 
corresponding to the instruction. 

2. A microprocessor system according to claim 1, wherein instructions corresponding 
to fixed and predefined operations are distinguished from instructions corresponding to user 
defined operations by a bit in a predefined bit position of the instruction code, and the 
means for analysing each stored instruction is arranged to check for the presence of a bit in 
that bit position. 

3. A microprocessor system according to claim 1 or 2, wherein the microprocessor 
system comprises a data memory arranged in use to store code defining said subroutines. 

4. A microprocessor system according to claim 3, wherein the system comprises 
means for generating an address corresponding to the location of a subroutine if an 
instruction corresponds to a user defined operations. 

5. A microprocessor system according to claim 4, wherein said distinguishing bit is the 
most significant bit of the instruction code, and the generating means is arranged to shift 
the code to the left by one or more bits. 

6. A microprocessor system according to claim 5 and comprising a program counter 
register which is arranged to load the bit shifted instruction. 
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7. A microprocessor system according to any one of claims 3 to 6, wherein the 
instruction memory is arranged to hold 8-bit wide instructions, whilst the data memory is 
arranged to hold 32-bit data values. 

8. A microprocessor system according to any one of the preceding claims and 
comprising a hardware stack arranged in use to store a return address when a subroutine is 
entered, the return address pointing to the next instruction in the instruction memory when 
execution of the subroutine is completed. 

9. A microprocessor system according to claim 8, wherein the central processing unit, 
instruction memory, data memory, hardware stack, and program counter are all coupled to a 
common bus. 

10. A microprocessor system according to claim 9, wherein the central processing unit, 
instruction memory, data memory, hardware stack, program counter, and common bus are 
integrated onto a single chip. 

11. A microprocessor system according to any one of the preceding claims, wherein the 
central processing unit contains an arithmetic logic unit and a data stack, and the top two 
elements of the data stack are connected to the inputs of the arithmetic logic unit and the 
output of the arithmetic logic unit is connected to an internal data bus. 

12. A microprocessor system according to claim 11, wherein the top three elements of 
the data stack contain special-purpose circuits, which enable the execution of seven 
primitive stack operations directly in hardware. 

13. A microprocessor system according to any one of the preceding claims and 
comprising means for recognising a "fast return" instruction "folded" with a regular 
instruction, by utilising circuitry which decodes the "fast call" bit in an 8-bit bytecode and 
another dedicated bit or bits in the 8-bit bytecode. 

14. A microprocessor system according to claim 13, wherein said other dedicated bit is 
the second most significant bit in the 8-bit bytecode. 



WO 00/22507 



14 



PCT/CB99/03100 



15. A microprocessor system, consisting of a central processing unit, 8-bit wide 
instruction memory, 32-bit wide data memory and a hardware stack connected via an 
internal bus to the program counter register, together with an instruction decode unit which 
includes a circuit for detecting the presence of a distinguished bit in the 8-bit bytecode, 
together with a circuit for loading the remaining bits of the bytecode shifted left by a 
number of bits into the microprocessor's program counter register, while at the same time 
storing the current value of the program counter register on the aforementioned stack. 



